Information-theoretic limits of Bayesian network structure learning

ثبت نشده
چکیده

While there has been a lot of prior work on determining the information-theoretic limits of structure recovery in Markov random fields (MRFs), which are undirected graphical models, characterizing the information-theoretic limits of learning BNs (directed models) is important in its own right for the following reasons. First, unlike MRFs where the undirected graph corresponding to a dependence structure is uniquely determined, multiple DAG structures can encode the same dependence structure in BNs. Therefore, one has to reason about Markov equivalent DAG structures in order to characterize the information-theoretic limits of structure recovery in BNs. Second, the complexity of learning MRFs is characterized in terms of parameters of the joint distribution over nodes, which in turn relates to the overall graph structure, while the complexity of learning BNs is characterized by parameters of local conditional distributions of the nodes. The latter presents a technical challenge, as shown in the paper, when the marginal or joint distribution of the nodes in a BN do not have a closed form solution. A recurring theme in the available literature on information-theoretic limits of learning MRFs, is to construct ensembles of MRFs that are hard to learn and then use the Fano’s inequality to lower bound the estimation error by treating the inference procedure as a communication channel. Santhanam and Wainwright [13] obtained necessary and sufficient conditions for learning pairwise binary MRFs. The necessary and sufficient conditions on the number of samples scaled as O k2 logm and O k3 logm respectively, where k is the maximum node degree. Information theoretic limits of learning Gaussian MRFs was studied by Wang et al. [14] and for walk-summable Gaussian networks, by Anandkumar et al. [17]. In [18], Anandkumar et al. obtain a necessary condition of ⌦(c logm) for structure learning of Erdős-Rényi random Ising models, where c is the average node degree.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Information-theoretic limits of Bayesian network structure learning

In this paper, we study the informationtheoretic limits of learning the structure of Bayesian networks (BNs), on discrete as well as continuous random variables, from a finite number of samples. We show that the minimum number of samples required by any procedure to recover the correct structure grows as Ω (m) and Ω ( k logm+ k 2 /m ) for non-sparse and sparse BNs respectively, where m is the n...

متن کامل

Information-theoretic lower bounds on learning the structure of Bayesian networks

In this paper, we study the information-theoretic limits of learning the structure of Bayesian networks (BNs), on discrete as well as continuous random variables, from a finite number of samples. We show that the minimum number of samples required by any procedure to recover the correct structure grows as Ω (m) and Ω (k logm+ k/m) for non-sparse and sparse BNs respectively, where m is the numbe...

متن کامل

Learning Bayesian Network Structure using Markov Blanket in K2 Algorithm

‎A Bayesian network is a graphical model that represents a set of random variables and their causal relationship via a Directed Acyclic Graph (DAG)‎. ‎There are basically two methods used for learning Bayesian network‎: ‎parameter-learning and structure-learning‎. ‎One of the most effective structure-learning methods is K2 algorithm‎. ‎Because the performance of the K2 algorithm depends on node...

متن کامل

Learning Bayesian Network Structure Using Genetic Algorithm with Consideration of the Node Ordering via Principal Component Analysis

‎The most challenging task in dealing with Bayesian networks is learning their structure‎. ‎Two classical approaches are often used for learning Bayesian network structure;‎ ‎Constraint-Based method and Score-and-Search-Based one‎. ‎But neither the first nor the second one are completely satisfactory‎. ‎Therefore the heuristic search such as Genetic Alg...

متن کامل

An Introduction to Inference and Learning in Bayesian Networks

Bayesian networks (BNs) are modern tools for modeling phenomena in dynamic and static systems and are used in different subjects such as disease diagnosis, weather forecasting, decision making and clustering. A BN is a graphical-probabilistic model which represents causal relations among random variables and consists of a directed acyclic graph and a set of conditional probabilities. Structure...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017